Decision tree-based simultaneous clustering of phonetic contexts, dimensions, and state positions for acoustic modeling
نویسندگان
چکیده
In this paper, a new decision tree-based clustering technique called Phonetic, Dimensional and State Positional Decision Tree (PDS-DT) is proposed. In PDS-DT, phonetic contexts, dimensions and state positions are grouped simultaneously during decision tree construction. PDS-DT provides a complicate distribution sharing structure without any external control parameters. In speaker-independent continuous speech recognition experiments, PDS-DT achieved about 13%–15% error reduction over the phonetic decision tree-based state-tying technique.
منابع مشابه
Decision tree distribution tying based on a dimensional split technique
In this paper, a new clustering technique called Dimensional Split Phonetic Decision Tree (DS-PDT) is proposed. In DSPDT, state distributions are split dimensionally when applying phonetic question. This technique is an extension of the decision tree based acoustic modeling. It gives a proper context-dependent sharing structure of each dimension automatically while maintaining the correlations ...
متن کاملDecision Tree Distribution Tying Bas Technique
In this paper, a new clustering technique called Dimensional Split Phonetic Decision Tree (DS-PDT) is proposed. In DSPDT, state distributions are split dimensionally when applying phonetic question. This technique is an extension of the decision tree based acoustic modeling. It gives a proper context-dependent sharing structure of each dimension automatically while maintaining the correlations ...
متن کاملRobust decision tree state tying for continuous speech recognition
In this paper, methods of improving the robustness and accuracy of acoustic modeling using decision tree based state tying are described. A new two-level segmental clustering approach is devised which combines the decision tree based state tying with agglomerative clustering of rare acoustic phonetic events. In addition, a unified maximum likelihood framework for incorporating both phonetic and...
متن کاملUnified framework for acoustic topology modelling: ML-SSS and question-based decision trees
State-shared, context-dependent, acoustic HMM's are the basis of practically all large-vocabulary state-of-the-art speech recognition systems. The topology, i.e. state-sharing, is usually trained by decision tree based clustering of similar phonetic contexts, i.e. divisive clustering on the state level. In this paper, we show that Phonetic Decision Trees (PDT) and Maximum Likelihood Successive ...
متن کاملHigh resolution decision tree based acoustic modeling beyond CART
In this paper, an m-level optimal subtree based phonetic decision tree clustering algorithm is described. Unlike prior approaches, the m-level optimal subtree in the proposed approach is to generate log likelihood estimates using multiple mixture Gaussians for phonetic decision tree based state tying. It provides a more accurate model of the log likelihood variations in node splitting and it is...
متن کامل